1 |
The Telegram Chronicles of Online Harm
|
|
|
|
In: Journal of Open Humanities Data; Vol 7 (2021); 8 ; 2059-481X (2021)
|
|
Abstract:
Harmful language is frequent in social media, in particular in spaces which are considered anonymous and/or allow free participation. In this paper, we analyze the language in a Telegram channel populated by followers of former US President Donald Trump. We seek to identify the ways in which harmful language is used to create a specific narrative in a group of mostly like-minded discussants. Our research has several aims. First, we create an extended taxonomy of potentially harmful language that includes not only hate speech and direct insults (which have been the focus of existing computational methods), but also other forms of harmful speech discussed in the literature. We manually apply this taxonomy to a large portion of the corpus, including the time period leading up to and the aftermath of the January 2021 US Capitol riot. Our data gives empirical evidence for harmful speech, such as in/out-group divisive language and the use of codes within certain communities, that have not often been investigated before. Second, we compare our manual annotations of harmful speech to several automatic methods for classifying hate speech and offensive language, namely list-based and machine-learning-based approaches. We find that the Telegram data sets still pose particular challenges for these automatic methods. Finally, we argue for the value of studying such naturally-occurring, coherent data sets for research on online harm and how to address it in linguistics and philosophy.
|
|
Keyword:
computational linguistics; corpus linguistics; hate speech; linguistics; offensive language detection; online harm; philosophy; social media; Telegram
|
|
URL: https://doi.org/10.5334/johd.31 https://openhumanitiesdata.metajnl.com/jms/article/view/31
|
|
BASE
|
|
Hide details
|
|
2 |
Competition, selection and communicative need in language change: an investigation using corpora, computational modelling and experimentation ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Analysis of an Extracted Discipline-Specific Computer Science Vocabulary List
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Sujeito oculto às claras: uma abordagem descritivo-computacional / Omitted subjects revealed: a quantitative-descriptive approach
|
|
|
|
In: Revista de Estudos da Linguagem, Vol 29, Iss 2, Pp 1033-1058 (2021) (2021)
|
|
BASE
|
|
Show details
|
|
11 |
A Corpus Approach to Roman Law Based on Justinian’s Digest ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
The Quest for 'Falsehood', or a Survey of Tools for the Study of Greek-Syriac-Arabic Translations ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
The Quest for 'Falsehood', or a Survey of Tools for the Study of Greek-Syriac-Arabic Translations ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Discovering and analysing lexical variation in social media text ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Discovering and analysing lexical variation in social media text
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Automatic syntactic analysis of learner English ...
|
|
Huang, Yan. - : Apollo - University of Cambridge Repository, 2019
|
|
BASE
|
|
Show details
|
|
20 |
Detection of Longitudinal Development of Dementia in Literary Writing
|
|
|
|
In: http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1524651391474684 (2018)
|
|
BASE
|
|
Show details
|
|
|
|